DeQue: A Lexicon of Complex Prepositions and Conjunctions in French

نویسندگان

  • Carlos Ramisch
  • Alexis Nasr
  • André Valli
  • José Deulofeu
چکیده

We introduce DeQue, a lexicon covering French complex prepositions (CPRE) like à partir de (from) and complex conjunctions (CCONJ) like bien que (although). The lexicon includes fine-grained linguistic description based on empirical evidence. We describe the general characteristics of CPRE and CCONJ in French, with special focus on syntactic ambiguity. Then, we list the selection criteria used to build the lexicon and the corpus-based methodology employed to collect entries. Finally, we quantify the ambiguity of each construction by annotating around 100 sentences randomly taken from the FRWaC. In addition to its theoretical value, the resource has many potential practical applications. We intend to employ DeQue for treebank annotation and to train a dependency parser that takes complex constructions into account.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PrepLex: A Lexicon of French Prepositions for Parsing

PrepLex is a lexicon of French prepositions which provides all the syntactic information needed for parsing. It was built by comparing and merging several authoritative lexical sources. This lexicon also includes information about the prepositions or classes of prepositions that appear in French verb subcategorization frames. This resource has been developed as a first step in making current Fr...

متن کامل

Ontology and Lexical Semantics for Generating Temporal Discourse Markers

In text, temporal relations between events can be signalled in several ways; among them are speciic lexical items, here called temporal discourse markers. We analyse the semantics of about 20 German subordinating conjunctions and prepositions and transfer these ndings to a sentence generation framework that uses a dedicated discourse marker lexicon for producing complex sentences. After discuss...

متن کامل

Attacking Parsing Bottlenecks with Unlabeled Data and Relevant Factorizations

Prepositions and conjunctions are two of the largest remaining bottlenecks in parsing. Across various existing parsers, these two categories have the lowest accuracies, and mistakes made have consequences for downstream applications. Prepositions and conjunctions are often assumed to depend on lexical dependencies for correct resolution. As lexical statistics based on the training set only are ...

متن کامل

Joint Dependency Parsing and Multiword Expression Tokenization

Complex conjunctions and determiners are often considered as pretokenized units in parsing. This is not always realistic, since they can be ambiguous. We propose a model for joint dependency parsing and multiword expressions identification, in which complex function words are represented as individual tokens linked with morphological dependencies. Our graphbased parser includes standard secondo...

متن کامل

Towards Invariant Meanings Of Spatial Prepositions And Preverbs

This work presents the semantical analysis of the two spatial prepositions and associated prefixes, the French sur, sur-(on) and the Polish przez, prze-(across). We propose a theory of abstract places (loci), as a method of description which helps to build an invariant meanings of the two linguistics units. 1 Introduction Natural languages encode spatial and temporal representations in many var...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016